A Grammar Based Approach To A Grammar Checking Of Free Word Order Languages
نویسندگان
چکیده
This paper shows one of the methods used for grammar checking, as it is being developed in the frame of the EC funded project LATESLAV -Language Technology li)r Slavic Languages (PECO 2824). The languages under consideration in the project Czech and Bulgarian are both free word order languages, therefore it is not sufficient to use only simple pattern based methods for error checking. The emphasis is on grammar-based methods, which are much closer to parsing than pattern-based methods. It is necessary to stress that we are dealing with a surface syntactic analysis. Therefore also the errors which are taken into consideration are surface syntactic errors. Our system for identification and localization of (surl:ace) syntactic errors consists of two basic modules the module of lexical analysis and the module of surface syntax checking. In the present paper, we will describe the second module, which is more complicated and creates the core of the whole system. Although it is not crucial for our method, we would like to point out that our approach to the problems of grammar checking is based on dependency syntax. Let us illustrate the degree of licedom of the word order, which is provided by Czech, one of the languages under consideration in the project. If we take a sentence like "Oznaeen3~ (Adj. masc., Nom/Gen Sg.) soubor (N masc., Nom/Gen Sg.) se (Pron.) nepodafilo (V neutr., 3rd pers. Sg) tisp6~nE (Adv.) otev~ft (V inf.)" (The marked file failed to be opened sucessfully); word-tbr-word translation into English "Marked file itself failed succesfully to open", we may modify the word order for instance in the following way:
منابع مشابه
On Word and Frontier Languages of Unsafe Higher-Order Grammars
Higher-order grammars are an extension of regular and context-free grammars, where nonterminals may take parameters. They have been extensively studied in 1980’s, and restudied recently in the context of model checking and program verification. We show that the class of unsafe order-(n+1) word languages coincides with the class of frontier languages of unsafe order-n tree languages. We use inte...
متن کاملInformation Structure in Topological Dependency Grammar
Topological Dependency Grammar (TDG) is a lexicalized dependency grammar formalism, able to model languages with a relatively free word order. In such languages, word order variation often has an important function: the realization of information structure. The paper discusses how to integrate information structure into TDG, and presents a constraint-based approach to modelling information stru...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملThe Impact of Structured Input-based Tasks on L2 Learners’ Grammar Learning
Abstract Task-based language teaching has received increased attention in second language research. However, the combination of structured input-based approach and task-based language teaching has not been examined in relation to L2 grammar learning. To address this gap, the present study investigated how the structured input-based tasks with and without explicit information impacted learners’ ...
متن کاملThe Impact of Structured Input-based Tasks on L2 Learners’ Grammar Learning
Abstract Task-based language teaching has received increased attention in second language research. However, the combination of structured input-based approach and task-based language teaching has not been examined in relation to L2 grammar learning. To address this gap, the present study investigated how the structured input-based tasks with and without explicit information impacted learners’ ...
متن کامل